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Title: A process for site-directed integration of 

multiple copies of a gene in a mould 

The invention relates to a process for site-directed 
5 integration of multiple copies of a gene in a mould, to a 
transformed mould obtainable by such process, to a process 
for culturing such transformed mould, and to a process for 
producing and optionally secreting a desired protein by 
culturing such transformed mould. In particular, the inven- 
tion provides a process for preparing a protein by a mould 
transformed by multicopy integration of at least one 
expressible gene comprising a structural gene encoding a 
desired protein into the genome of a mould, especially of 
moulds belonging to the genus Aspergillus. 
In this specification the expression "expressible gene- 
means a structural gene encoding a protein, either 
homologous or heterologous to the host organism, in 
combination with DNA sequences for proper transcription and 
translation of the structural gene, and optionally with 
secretion signal DNA sequences, which DNA sequences should 
be functional in the host mould. Further, in this 
specification the expressions "mould" and "filamentous 
fungus" are considered as synonyms. 

Backcrround of the invA ntion p T-i»^ 

1. Filamentous fungi and especially species such as 
Aspergillus awamori, Aspergillus niger, Trichoderma reesei 
and Fusarium graminearum have shown to be attractive hosts 
for large scale production of homologous and heterologous 
proteins. They have the capacity to secrete substantial 
amounts of protein into the medium, large scale fermen- 
tation is generally well established and most of them they 
have a GRAS {Generally Recognized As Safe) status, which 
makes it possible to use these species in the food and 
food-processing industry. Moreover, the mould Fusariuxn 
graminearum A 3/5, the Quorn« myco-protein fungus, has also 
been used as a commercial human food source in the UK for 
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over 10 years (Royer et ai . ; Bio/Technology 13 (1995) 1479- 
1483) . 

The production of fungal proteins, of either homologous or 
heterologous origin, by filamentous fungi is usually very 
5 efficient and production levels of grams per litre were 

reached. However, compared to this the production levels of 
heterologous proteins of mammalian, bacterial or plant 
origin in moulds are relatively low. In order to improve 
the production of both homologous and heterologous proteins 
10 several strategies have been developed. The basic strategy 
that is commonly applied to achieve higher protein 
production in moulds is the introduction of multiple copies 
of the gene encoding the desired protein. 

15 2. Whereas moulds have been successfully used for the 
production of enzymes, antibody fragments and peptides at 
laboratory and commercial scale (xylanase, pectinase, etc) , 
the acceptance of products from these genetically modified 
organisms (GMO) in the market has experienced some 

20 unexpected difficulties in the past few years. 

(a) In general there is a growing concern about the use 
of antibiotic resistance genes in genetically modified 
organisms. The main reason for this concern is the 
possibility that such a gene might be transferred into and 

25 expressed in gut micro-organisms, which would thereby 
become antibiotic resistant ("Report on the use of 
antibiotic resistance markers in genetically modified food 
organisms" published by the Advisory Committee on Novel 
Foods and Processes, Ministry of Agriculture, Fisheries and 

30 Food, England, 1994) . 

(b) Further, the presence of other foreign DNA such as 
remnants of vector DNA used in cloning is also undesired. 

(c) Another concern is the fact that in general the 
genetically modified strains contain randomly integrated 

35 genetic material. In the perception of some consumer 

organisations this would constitute an unpredictable safety 
risk, and could mean a barrier to the acceptation of 
derived products. 
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3. Therefore, the recombinant mould should ideally 
contain multiple copies of the gene encoding the desired 
protein integrated at only a predetermined locus in the 
genome and no other foreign DNA should be present in order 
to produce proteins in moulds in both an economically 
attractive manner and in a way that deals with the concerns 
about genetically modified organisms as described above. 

The generation of mould strains that meet these criteria 
has not been reported in literature. 



The commonly applied system for integration of single or 
multiple copies of a gene into the genome of moulds,. e.g. 
Aspergillus. Trichoderma and Fusarium graminearum makes use 

15 of plasmids which in addition to the gene encoding a 

desired protein contain bacterial marker genes encoding 
resistance to antibiotics (e.g. Ampicillin) and other 
vector sequences. Therefore, genetically modified moulds 
will usually contain antibiotic resistance genes and other 

20 vector DNA. 

Whereas, targeted integrations of a single gene copy have 
been described regularly (e.g. Timberlake, "Gene Cloning 
and Analysis" (Chapter 3) in the book "More Gene 
25 Manipulations in Fungi" (1991) 51-85, edited by Bennett and 
Lasure; Gouka et al. Applied and Environmental Microbiology 
£2 (1996) 1951-1957) it has been proven to be very 
difficult to obtain mould strains that contain multiple 
gene copies integrated at a predetermined locus in the 
30 genome. Gouka et al . (Curr. Genet. 22 (1995) 536-540) 

reported the selection of targeted multi-copy integrations 
at the pyrG locus in A. awamori, but the recombinant 
strains were obtained from transformations in which DNA was 
used containing vector sequences and no information was 
35 presented on the number of gene copies that were integrated 
at the pyrG locus. For Aspergillus nidulans a similar 
observation on targeted tandem integration at the argB 
locus was published (Van den Hondel and Punt, "Gene 
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transfer systems and vector development" (Chapter 1) in the 
book "Applied Molecular Genetics" (1991) 1-28, edited by 
Peberdy et al . ) . 

Several other publications indicate that site-directed 
5 integration of multiple gene copies could not be obtained, 

although it was desired for scientific or commercial 
purposes, (Kubicek-Pranz et al. J. of Biotech, 2^ (1991) 
83-94; Van den Hondel et al. Antonie van Leeuwenhoek jSl 
(1992) 153-160; Verdoes et al. Transgenic Research 2 (1993) 

10 84-92; Archer et al . Antonie van Leeuwenhoek 65 (1994) 245- 
250; Van Gemeren et ai . Applied Microbiology and 
Biotechnology 45 (1996) 755-763; Van Gemeren, "Expression 
and secretion of defined cutinase variants by Aspergillus 
awamori" (Chapter 5) Thesis University of Utrecht (19 97) 

15 ISBN 90-393-1229-X) . 

4. Previously, two processes have been described in 
literature that, in principle, might allow the generation 
of mould strains that contain multiple copies of a gene 
2 0 that are integrated at a predetermined locus in the genome 
without the presence of other foreign DNA. 

The first process describes the preparation of a protein by 
a fungus transformed by site-directed multicopy integration 

2 5 of an expression vector in the ribosomal DNA locus of the 
fungal genome as described in International PCT patent 
application WO-A-91/00920 ; Unilever, published 24 January 
1991. Although the Examples were carried out with yeasts, 
it was envisaged that such process is also applicable to 

30 moulds. Thus such process could make it possible to 

construct a mould strain in which multiple copies of a gene 
are integrated at a predetermined locus of the genome, 
without the presence of other foreign DNA. 

35 However, transformation of moulds follows a somewhat 
different pattern than the transformation of yeasts. 
Whereas in the yeast Saccharomyces cerevisiae transforming 
DNA is integrated into the genome of the cell via 
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homologous recombination at the corresponding homologous 
site, in filamentous fungi such as the mould Aspergillus 
awamori DNA integrates mainly via illegitimate 
recombination at random sites in the genome (Finkelstein, 
"Transformation" Chapter 6 in the book "Biotechnology of 
Filamentous Fungi" (1992) 113-156, edited by Finkelstein 
and Ball) . For instance, for the mould A. awamori Gouka et 
al. (Curr. Genet. 17 (1995) 536-540) performed an analysis 
on a large number of transf ormante and showed that DNA 
integrated via homologous recombination in approximately 10 
% of the transf ormants, whereas the remaining 90 % 
integrated randomly. This means that transf ormants have to 
be screened for site-directed integration events. 
Therefore, a process for transformation of moulds as 
described in WO-A-91/00920 would require lengthy screening 
procedures because DNA that is introduced into the mould 
cell can also integrate randomly and not only via 
homologous recombination at the predetermined site. 



20 



The second process describes the site-directed integration 
of a single gene copy whereby any other heterologous DNA 
used for cloning and any heterologous mould selection 
marker are removed, as described in European patent 
application EP-Al-O 635 574; Gist-brocades N.V., published 
25 25 January 1995. 

If this second process would be used for preparing a trans- 
formed mould containing multiple gene copies, the process 
IS very cumbersome, because the whole process need be 
repeated for each subsequent copy that needs to be 
30 introduced. 

Although the repetition of the second process for obtaining 
multicopies is mentioned as simple statements in the 
specification (see e.g. page 3, lines 22-23, page 6, lines 
21-25, page 7, lines 29-30, and page 8, lines 9-11, and IB- 
IS), It was not shown in the Examples that it really works 
In fact the statement "sequential application of the same 
technology" mentioned on page 8, lines 9-11 confirms the 
laborious character of this method for introducing multiple 



35 
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gene copies at predetermined loci, covering both a single 
site and multiple sites. 

A further disadvantage of the method described is the risk 
that the earlier introduced desired foreign DNA is removed 
5 during a subsequent repetition of the process. 

In summary, items 1-4 above show that there exists a need 
in the field of mould biotechnology to construct mould 
strains containing multiple copies of a gene encoding a 

0 desired protein that are integrated at a predetermined 
locus in the genome and that are free of bacterial 
antibiotic resistance genes or of other foreign DNA such as 
remnants of vector DNA used in cloning. Ideally, the 
recombinant microorganism should only contain the 

5 heterologous gene encoding the desired protein. 

Summary of the invention 

The invention is applicable in the field of mould 
biotechnology and provides a new and more advanced process 
for site-directed integration of multiple copies of. a gene 
in a mould without leaving any undesired DNA, i.e. without 
leaving in the transformed mould the selection marker used 
for selection of transf ormants or other DNA used for 
cloning. The invention is based on the specific 
introduction of a double-strand break at the chromosomal 
target in the mould cell which significantly enhances site- 
directed integration at that locus. Repair of the break 
with a repair DNA homologous to the regions flanking the 
break and including multiple copies of at least one gene 
encoding at least one desired protein will lead to 
simultaneously integration of those multiple copies at the 
locus of the break. 

The present invention provides a process for transforming a 
mould, in which 

(1) multiple copies of a desired gene are integrated in 
the chromosome of said mould, 
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(2) 



(3) 



10 



(4) 



is] 



the integration in the mould genome is site-directed 
via homologous recombination in contrast to the usual 
random integration of moulds, 

such site -directed integration event is selected 
preferentially over any possible random integration 
event, e.g. by selecting for the restoration of a 
defective marker gene, 

remaining foreign DNA sequences, e.g. antibiotic 
resistance genes and DNA originating from other 
organisms, can be avoided, and 

a rare-cutting endonuclease, e.g. l-Scel, is used to 
introduce a double-strand break in the chromosomal 
DNA of the mould, 



15 Although the emphasis is given to the use of I-5ceI as a 
rare-cutting endonuclease, it is envisaged that also other 
rare-cutting endonucleases can be used, including HO Endo- 
nuclease and VDE, the latter also being known as Pl-Scel. 
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Brief deflrT-S ption of the drawing s 
Figure 1. shows the construction of the plasmids pUR5710, 
PUR5711 and pUR5712, in which 
amp = ampicillin resistance gene, and 

pyrG = pyrG gene from A. awamori. ' pyrG or pyrG' indicates 
that the gene is truncated at the 5' or 3 ' end, 
respectively. 



Figure 2. shows the construction of the plasmids pUR5713 
(Figure 2A) and pUR57l4 (Figure 2B) , in which 
30 pBS-SK = pBluescript^-SK. 

Figure 3. shows the construction of the plasmids pUR5716 
and PUR5718, in which cos = cos site. 

35 Figure 4. shows the construction of the plasmid pUR5729, in 

which 

PexlA = Promoter sequences of the A, awamori 1,4-/3- 
endoxylanase A gene. 
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cut = coding region of the F, solani pisi cutinase gene 

(synthetic copy of cDNA) , and 
TexlA = Terminator sequences of the A. awawori l,4-)3- 

endoxylanase A gene. 

Figure 5. shows the construction of the cosmids pUR5722 and 
PUR5725. 

Figure 6, shows the construction of the plasmids pUR5736 
and pUR5737, in which 

Pgpd = Promoter sequences of the A. nidulans gpd gene, 
hph = coding region of the hygromycin phosphotransferase 

gene from E, coli, and 
TtrpC = Terminator sequences from the A. nidulans trpC 

gene . 

Figure 7. shows the construction of plasmid pUR5724, in 
which I-Scel = synthetic gene encoding the Saccharomyces 
cerevisiae 1-SceI endonuclease . 

Figure 8. Experimental design of the process for site- 
directed integration of multiple copies of a gene in the 
mould A. awamori using the 1-Scel endonuclease. The wild- 
type pyrG gene is depicted in the upper part of the figure. 
The coding region of the gene is indicated by the light 
grey shaded box. Below this, the target locus containing 
the 

I-Scel restriction site as present in the A. awawori strain 
AWCSCE is shown. Between the non-functional 5' part of the 
pyrG gene and 3' flanking sequences of the chromosomal pyrG 
locus an I-5ceI site is present. The fragment that is 
introduced into the strain AWCSCE contains a non-functional 
3' part of the pyrG gene that is partially homologous to 
the mutated pyrG gene at the chromosomal target locus, one 
or multiple gene copies (indicated by the dark grey shaded 
boxes 1,2 and n) comprising at least one structural gene 
encoding at least one desired protein and an additional 
sequence from the pyrG locus that is homologous to 
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sequences present immediately downstream of the I-5ceI site 
at the target locus. Simultaneously, the I-Scel endonu- 
clease or an expression plasmid containing the I-Scel gene 
is co-introduced into the cell. After homologous recombina- 
tion induced by a double-strand break at the I-Scel site an 
intact pyrG gene is restored and the multiple gene copies 
are simultaneously integrated at the pyrG locus, which is 
illustrated in the lower part of the figure. 



10 



20 



Figure 9. shows the autoradiograph of the Southern blot of 
A. awamori genomic DNA probed with an 18 bp end- labelled 
oligonucleotide representing the I-5ceI restriction site. 
The genomic DNA was digested with 5au3A. M represents the 1 
kb DNA marker (BRL) , lanes 1, 2 and 3 contain samples of 
15 the plasmid pSCM522 digested with Hintl (control DNA 

substrate containing the I-5ceI restriction site, supplied 
with the I-Scel endonuclease from Boehringer Mannheim, cat. 
no. 1497235) in concentrations that correspond to 200, 20 
and a single copy of the I-Scel restriction site(s) in the 
genome of A. awamori, respectively. Lanes 4 and 5 contain 
Sau3A digested genomic A. awamori DNA (7.5 ^g) . Lanes 6, 7 
and 8 contain samples of plasmid pUR5712 in concentrations 
that correspond to a single copy, 20 and 200 copies of the 
I-Scel restriction site{s) in the genome of A. awamori 

25 

Figure 10. shows the autoradiograph of the Southern blot of 
the two A. awamori mutant pyrC strains. The genomic DNA was 
digested with BgJiJ and I-Scel and probed with a 2.4 kb 
BamUl X Hindlll fragment from plasmid pAW4 . 1 containing the 
3 0 A. awamori pyrG gene. M represents the 1 kb DNA marker 

(BRL), Lane 1 and 2 contain genomic DNA of the A. awamori 
mutant pyrG strains. Lane 3 contains genomic DNA from the 
non-transformed wild-type A. awamori strain. 

35 Figure 11. Southern analysis of the recombinant strains 

obtained in the transformation of a. awamori strain AWCSCE 
(see Example 4). The genomic DNA was digested with Sail or 
flglll and probed with a number of different probes (see 
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Example 5) . The two probes that are used in the Southern 
blots shovm in Figure 12 are indicated; 
pyrG - a 2.4 }cb BairiRl x Hindlll fragment from pAW4 . 1 
containing the A. awamori pyrG gene. 
5 TexiA = 0.5 kb Aflll x Sad fragment from pUR5729 
containing the exlA terminator. 

The wild-type pyrG gene is depicted in the upper part of 
the figure. The Sail digestion will give a 3 . 3 kb and a 3.8 
10 kb fragment which fragments will hybridize with the pyrG 
probe. The Bglll digestion will give a 9 . 0 kb and a 2 . 7 kb 
fragment, which fragments do not hybridize with the TexlA 
probe . 

15 Below this, the target locus containing the I-5ceI 

restriction site as present in the A. awamori strain AWCSCE 
is shown. The Sail digestion will give a 3.3 kb and a 3.0 
kb fragment of which only the 3 . 3 kb fragment will 
hybridize with the pyrG probe. The Bglll digestion will 

2 0 give a 9.0 kb and a 2 . 7 kb fragment, which fragments do not 
hybridize with the TexiA probe. 

The lower part of the figure shows a restored pyrG gene 
containing none or multiple gene copies of the cutinase 
gene. The 5all digestion will give a 3.3 fragment and in 

25 recombinants obtained with pUR5718 a 3 . 7 kb fragment, in 

recombinants obtained with pUR5722 a 9.6 kb fragment and in 
recombinants obtained with pUR5725 a 17.1 kb fragment. All 
these fragments will hybridize with the pyrG probe. The 
Bglll digestion will give a 9.0 kb fragment and in 

30 recombinants obtained with pUR5718 a 2 . 6 kb fragment, in 

recombinants obtained with pUR5722 or pUR5725 a 1 . 7 kb or a 
1.9 kb fragment (depends on the orientation of the cutinase 
gene compared to the pyrG gene) and a 1 . 5 kb fragment, 
which is derived from the tandem repeat of the cutinase 

35 gene. Only the fragments containing the exlA terminator 

present in the cutinase expression cassette will hybridize 
with the TexiA probe. 
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The bottom part of the figure show the position of the pyrG 
and TexlA probes. 

Figure 12. shows the autoradiograph of the Southern blot 
analysis of the recombination events. M represents the 1 kb 
DNA marker (BRL) , 

Lane 1: recombinant AWC-pUR5718#Sl ; 
Lane 2: recombinant AWC-pUR5718#S2 ; 
Lane 3: recombinant AWC-pUR5718#l ; 

Lane 4: non- transformed wild-type A. awamori strain; 
Lane 5: A. awamori mutant pyrC strain AWCSCE; 
Lane 6: recombinant AWC-pUR5725#l ; 
Lane 7: recombinant AWC-pUR5725#2 ; 
Lane 8: recombinant AWC-pUR5722#Al ; 
15 Lane 9: recombinant AWC-pUR5722#A2 ; 
Lane 10: recombinant AWC-pUR5722#Bl ; 
Lane 11: recombinant AWC-pUR5722#B2 ; 
Lane 12: recombinant AWC-pUR5722#B3 . 

A. The genomic DNA is digested with Sail and probed with 
the pyrG probe (see Figure 11) . 
B- The genomic DNA is digested with Bglll and probed with 
the TexlA probe. 

Detailed descriptiop of the invention 

25 The invention provides a process for site-directed 

integration of multiple copies of a gene in a mould, which 
comprises 

(i) providing a mould cell containing in its chromosomal 
DNA a restriction site for a rare-cutting endonu- 

30 clease, 

(ii) transforming such mould cell with a piece of DNA 
comprising in the 5' to 3' direction in the following 
order 

(a) a first DNA fragment homologous to part of the 

™^ upstream and in the neighbourhood of the re- 
striction site for the rare-cutting endonuclease 
present in the chromosomal DNA of the mould 



20 
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(b) multiple copies of at least one expressible gene 
comprising a structural gene encoding a desired 
protein, 

(c) a second DNA fragment homologous to part of the 
DNA downstream and in the neighbourhood of the 
restriction site for the rare-cutting endonu- 
clease present in the chromosomal DNA of the 
mould, 

while during the transformation of the mould the 
presence of the rare-cutting endonuclease is 
provided, 

(iii) selecting or screening for a mould cell in which the 
multiple gene copies of said expressible gene are inserted 
into the chromosomal DNA of the mould. 

For site-directed integration it is desirable to use a 
restriction endonuclease forming a double -strand break at 
the target site, that does not form breaks at other loci in 
the chromosome, thus a rare -cutting endonuclease, an 
example of which is the I-Scel endonuclease from 
Saccharomyces cerevisiae. The nucleotide sequence encoding 
this enzyme and some uses of that sequence are described in 
International PCX patent application WO 96/144 08; Institut 
Pasteur, published 17 May 1996. But also other known rare- 
cutting endonucleases can be used in a process according to 
the invention, e.g. VDE also known as PI-5ceI (see M. 
Jasin; Trends In Genetics (TIG) 12 (No. 6, June 1996) 224- 
228; Genetic manipulation of genomes with rare-cutting 
endonucleases and M. Brenneman et ai . ; Proc. Natl. Acad. 
Sci. USA 93 (1996) 3608-3612; Stimulation of 
intrachromosomal homologous recombination in human cells by 
electroporation with site-specific endonucleases) and HO 
Endonuclease (see M. Chiurazzi; The Plant Cell 8. (Nov. 
1996) 2057-2066; Enhancement of Somatic Intrachromosomal 
Homologous Recombination in Arahidopsis by the HO 
Endonuclease) . Moreover, if a specific mould genome is 
practically free from restriction sites for a more familiar 
restriction endonuclease, such endonuclease can be used as 
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well and can be considered a rare-cutting endonuclease for 
that specific mould genome. 

The structural gene encoding the desired protein which gene 
forms part of the expressible gene can be homologous or 
heterologous to the mould. 

In some cases the restriction site for the rare-cutting 
endonuclease occurs naturally in the chromosal DNA of the 
mould. But if the mould does not contain a restriction site 
for such rare-cutting endonuclease, the restriction site 
for the rare-cutting endonuclease can be introduced at a 
desired locus . 

The selection of the transformed mould can be carried out 
by using a selectable marker. Preferentially, such 
selectable marker is a characteristic of a naturally- 
occurring, wild-type mould strain, while the mould strain 
to be transformed is a mutant strain thereof, deficient in 
said selectable marker, e.g. the orotidine- 5' -phosphate 
decarboxylase gene {pyrG gene) which is present in wild- 
type Aspergillus awawori. But also other loci containing 
auxotrophic markers including trpC, argB, and niaD genes 
can be used, whereas other possible selectable markers 
include genes producing an easily assayable product. 
Sometimes the DNA introduced into the mould can be used as 
the selectable marker. For example, when the introduced DNA 
IS expressed, it can result in a product not produced in 
the non-transformed mould, but which is more or less easily 
assayable. Or the presence or absence of the DNA can be 
determined by applying PCR techniques. 

Preferably, the desired locus is within a selectable marker 
gene or in the neighbourhood thereof, m order to com- 
plement any disrupted or (partially) deleted gene the piece 
of DNA used for transforming the mould cell can comprise a 
third DNA fragment that completes any disrupted or 
(partially) deleted selectable marker gene in the 
chromosomal DNA. This would allow direct selection of 
strains containing the desired targeted integration of 
multiple copies of the gene. 
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The part of the DNA up-stream of the restriction site for 
the rare -cutting endonuclease present in the chromosomal 
DNA of the mould, to which the first DNA fragment is 
homologous, can be part of a selectable marker gene. 
Alternatively, the part of the DNA down- stream of the 
restriction site for the rare-cutting endonuclease present 
in the chromosomal DNA of the mould, to which the second 
DNA fragment is homologous, is part of a selectable marker 
gene. Or both the up-stream and down-stream parts can be 
part of the same selectable marker gene. 

If two or more restriction sites for the rare-cutting 
endonuclease are present, the number of integrated gene 
copies can be increased or several different genes can be 
introduced at different loci, or both. 

Preferably, the expressible gene comprises (1) a promoter 
operable in said mould, (2) optionally a DNA fragment 
encoding a secretion signal peptide facilitating the 
secretion of said desired protein from said mould, (3) a 
structural gene encoding said desired protein, and (4) 
optionally a terminator operable in said mould, whereby the 
promoter and the optional terminator control the expression 
of the structural gene. More preferably, the promoter, 
secretion signal and terminator are homologous to the mould 
to be transformed. In that case the amount of foreign DNA 
is kept to a minimum. 

During the transformation of the mould the rare-cutting 
endonuclease can be provided by adding the endonuclease as 
such in way that is similar to Restriction Mediated 
Integration (REMI; Kuspa and Loomis, Proc. Natl. Acad. Sci . 
USA 89 (1992) 8803-8807; Redman and Rodriguez, Exp. Mycol . 
18 (1994) 230-246) . This is preferred when the amount of 
foreign DNA introduced into the mould should be as low as 
possible . 

But for other reasons it can be convenient to form the 
rare-cutting endonuclease in situ by co-transforming the 



wo 99/32641 



PCT/EP98/06519 



15 



mould with DNA encoding the rare -cutting endonuclease, 
which DNA can be expressed during or after the 
transformation of the mould. Preferably, this DNA forms 
part of a plasmid that does not integrate in the genome, so 
5 that after further culturing the transformed mould strain 
can loose the plasmid while the desired DNA is still 
maintained in the genome. This event can be checked by 
further screening to confirm the absence of the rare- 
cutting endonuclease-encoding DNA in the genome of the 
10 recipient strain. 



20 



30 



Preferably the mould belongs to the fungal division of 
Eumycota. more preferably to one of the fungal sub- 
divisions Ascomycotina, Basidiomycotina, Deuteromycotlna, 
15 Mastigomycotina, and Zygomycotina , It is especially 
preferred that the mould is selected from the genus 
Aspergillus, more particularly belonging to the species 
Aspergillus awamori. The invention also provides a 
transformed mould obtainable by a process according to the 
invention for the site-directed integration of multiple 
copies of a gene in a mould. Once a transformed mould 
according to the invention has been obtained, such 
transformed mould can be used in a process for further 
culturing. 

The invention also provides a process for producing and 
optionally secreting a desired protein by culturing a 
transformed mould obtainable with a process as described 
above under conditions whereby the structural gene encoding 
said desired protein is expressed, and optionally isolating 
or concentrating the desired protein. 



25 



One way of introducing multiple copies of a gene is 
introducing several copies of the complete expression 
cassette as described below under the heading Construction 
3 5 of multi-copy vectors. 

An alternative is by introducing several copies of the 
structural gene (polycistronic) . After production of the 
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encoded polypeptide it has to be cleaved to form the single 
peptides, e.g. by using the enzyme KEX 2. 

The invention is exemplified by the following Example 
5 preceded by a description of the Materials and Methods that 
were used. In this Example the following is described. 

Example lA. Experimental design of the process for site- 
directed integration of multiple copies of a gene in the 
10 mould A. awaxnori using the I-Scel endonuclease . 

Example IB. Determination of the occurrence of a natural I- 
Scel restriction site in the genome of A. awa/nori . 

15 Example IC, Construction of the A. awamori mutant pyrG 

strain AWCSCE which contains an I-Scel restriction site at 
the locus of the mutated pyrG gene . 

Example ID. Induction of site-directed integration at the 
20 pyrG locus by I-Scel expression. 

Example IE. Southern blot analysis of recombination events. 

MATERIALS AND METHODS 

25 

Bacterial and mould strains 

For standard bacterial cloning the Escherichia coli strain 
DHSa (genotype: F\ endAl, hsdRU (r/ m/) , supEAA, thi-1, 
lambda-, recAl, gyrABS , relAl , a {argF- lacIZYA) U1G9 , deoR 
30 (phiSOd (lacz) AMIS) ; Hanahan; J. Mol . Biol. 166 (1983) 557- 
580) was used. For cloning multiple copies of a gene in a 
cosmid vector via packaging the Escherichia coli strain 
1046 (Cami,B. and Kourilsky , P . ; Nucleic Acids Research 5 
(1978) 2381) was used. 

35 

The mould strain Aspergillus awamori #40 (a derivative of 
A. awamori CBS 115.52 also mentioned in WO 93/12237, page 9 
line 13) was used to construct a pyrG derivative strain, 
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designated AWCSCE, containing an I-Scel restriction site at 
the pyrG locus. 

The preparation of A. awainori #4 0 (also known as A. niger 
var. awamori #40) was described in WO 91/19782 on page 13, 

5 lines 29-39, which read: 

The production level of the A. niger var, awamori transformants, 
however, can be further increased by using suitable A. niper var. 
awamori mutant strains, such as A* niger var. awamori #40, 
which produces clearly more xylanase than the wild type strain. 

0 The mutant A. niger var. awamori #40 has been obtained by mutagenesis 

of A. niger var. awamori spores and selection for xylanase production. In 
bran medium the " xvlA "A. niper var. awamori #40 transformant 
produced 190 000 U xylanase, which is a considerable increase over the 
best producing A. niper var. awamori transformant. 

In this specification the following endonuclease 
restriction sites are used: 

cfivina staaaerftd P>nH« giving blunt ends 



Afill CiTTAAG Smal CCCiGGG 

2 0 Bami GIGATCC 

Eg 111 AiGATCT 

EcdRl GiAATTC 

Hindlll AiAGCTT 

Hinfl GiANTC 

2 5 Ndel CAiTATG 

A^otl GC^GGCCGC 

Ps tl ctgca;g 

Sad GAGCTiC 

Sail GiTCGAC 

3 0 5au3AI 4.GATC 

Seal AGTiACT 



and the rare-cutting restriction endonuclease from 
Saccharomyces cerevisiae I-Scel 18 bp: 

5'-TAGGGATAACAGGGTAAT-3' (seeSEQID N0:1) 
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Plasmid constructions 

Standard recombinant DNA techniques were used for cloning 
(Sambrook et al . ; Molecular cloning - A laboratory manual. 
Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 
(1989)) . In all cloning steps involving synthetic DNA 
linkers or PCR fragments, the correct DNA sequence of the 
linkers or PCR fragments was verified by DNA sequence 
analysis, using a Pharmacia LKB, ALF fluorescent sequencer. 



Construction of the target site: 

The plasmid pUR5710 (see Figure 1) was constructed by 
cloning a 2.0 kb BamHI/Sall fragment containing a 5' part 
of the pyrG gene, which is present on the plasmid pAW4 . 1 
(Gouka et ai . ; Curr. Genet. 27 (1995) 536-540), into the 
general cloning vector pIC20R (Marsh et al . ,- Gene 32 (1984) 
481-485) digested with BamHI and Sail. Subsequently, a 
synthetic DNA linker containing the 18 bp recognition site 
for the I-Scel endonuclease 

(5'-TAGGGATAACAGGGTAAT-3'; see SEQ ID NO: 1) flanked by 
Sail and ifindlll sites was cloned into the plasmid pUR5710 
digested with Sail and Hindi II . This resulted in the 
plasmid pUR5711 (see Figure 1) . The plasmid pUR5712 (see 
Figure 1) was constructed by cloning a 2.0 kb Hindlll 
fragment containing sequences downstream of the pyrG coding 
25 region, which is present on the plasmid pAW4.4 (Gouka et 
ai.; Curr. Genet, Zl (1995) 536-540), into the plasmid 
PUR5711 digested with Hindlll. The orientation of this 
Hindi I I fragment compared to the coding region of the pyrG 
gene is identical to the wild- type situation. The plasmid 
PUR5712 was used to construct the A. awamori mutant pyrG- 
strain AWCSCE. 



Construction of the repair construct: 

For the construction of the plasmid pDR5713 (see Figure 2A) 
35 plasmid pAW4.4 was digested with Hindlll, the Hindlll site 
was filled in with Klenow and the fragment was subsequently 
digested with BamHI. The resulting 1 . 6 kb fragment, 
containing sequences down-stream of the pyrG coding region, 
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was isolated. Furthermore, the plasmid pAW4.20 (Gouka et 
aJ.; Curr. Genet. 27 (1995) 536-540) was digested with 
Baimi and Hindlll and the 0.4 kb fragment, containing 
sequences present immediately upstream of the 1.6 kb 
5 fragment described above, was isolated. The 

0.4 kb /findlll/BamHI and 1.6 kb BainHI/fiHed in Hindlll 
fragments were simultaneously cloned into the general 
cloning vector pBluescript« SK (Stratagene) digested with 
Hindlll and Sn.al. This resulted in the plasmid pUR5713 . The 
10 plasmid PUR5714 (see Figure 2B) was constructed by cloning 
a 1.0 kb Bglll/Zfindlll fragment containing a 3' part of the 
pyrG gene, which is present on the vector pAW4.1, into the 
general cloning vector pBluescript« SK digested with .BaMll 
and Hindlll. The cosmid pUR5716 (see Figure 3) is derived 
15 from the cosmid vector pJB8 (Ish-Horowicz, D. and 

Burke, J. F.; Nucleic Acids Res. 9 (1981) 2989) by replacing 
the EcoRI/Hindlll polylinker fragment by a synthetic linker 
containing an EcoRI and a Notl restriction site having the 
following sequence: 
20 (5'-AATTC AT GCGGCCGC T-3' 

3'-G TA CGCCGGCG ATCGA-5' see SEQ ID NO: 2) . 

In this cloning step, the Hindlll site is lost. The cosmid 
ptJR5718 (see Figure 3) was constructed by simultaneously 
cloning the 1.0 kb Wotl/Hindlll fragment from the plasmid 
PUR5714 and the 2.0 kb Hindlll/WotI fragment from the 
plasmid PUR5713 into the plasmid paR5716 digested with 
Notl. Thereby, this vector carries a sequence homologous to 
both sides of the I-5ceI site at the pyrG target locus in 
the A. awamori mutant pyrG strain AWCSCE. 



25 



0 



Construction of multi-copy vectors: 

The plasmid pUR5729 (see Figure 4) was constructed by 
cloning the 1.5 kb Pstl/Sacl fragment containing the open 
reading frame (ORF) of the cutinase gene from Fusarium 
solani pisi (synthetic copy of the cDNA; Van Gemeren et 
aJ.; Journal of Biotechnology 40 (1995) 155-162) under 
control of the promoter and terminator of the exlA gene 
from Aspergillus awamori (Gouka et al . ; Applied 
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Microbiology and Biotechnology 46, (1996) 28-35), from the 
plasmid pUR7385 (Van Gemeren et al . ; Applied Microbiology 
and Biotechnology 45, (1996) 755-763), into the general 
cloning vector pIC19H (Marsh et aJ . ; Gene 32 (1984) 481- 
5 485) digested with PstI and 5acl . Based on the cosmid 
pUR5718 two new cosmids were constructed containing 
multiple copies of the cutinase gene under control of the 
exlA expression signals (as described above) . A single copy 
of this expression cassette was isolated as a 1 . 5 kb 

10 Hindlll fragment from the plasmid pUR5729 and ligated into 
the cosmid pUR5718 digested with Hindlll. After 
transforming the ligation mixture into the E. coli strain 
DHSof, the cosmid pUR5722 (see Figure 5) was obtained which 
contained a tandem array of four copies of the expression 

15 cassette. After packaging of the ligation mix using the X- 
DNA in vitro packaging module (Amersham; code RPN1717) , the 
packaging mix was transformed into E. coli strain 1046 
(both according to the protocol provided with the module) . 
From this transformation the cosmid pUR5725 (see Figure 5) 

2 0 was obtained which contained a tandem array of nine copies 
of the expression cassette. 

Construction of the 1-Scel expression vector: 

The plasmid pUR573 6 (see Figure 6) was constructed by 

2 5 replacing a Scal/BairHil fragment containing a part of the 
promoter from the A. nidulans gpd gene fused to the coding 
region of the E. coli hph gene, which is present on the 
plasmid pAN7 . 1 (Punt et aJ . ; Gene 56 (1987) 117-124), by a 
PGR fragment containing the same promoter fragment fused to 

30 a Ndel and BaMII restriction site, digested with Seal and 
Ba/nHI . To obtain the vector fragment from the plasmid 
pAN7.l the plasmid was only partially digested with Seal, 
because the pUC backbone of the plasmid contains another 
Seal site. The PGR fragment was obtained in a PGR reaction 

35 on the plasmid pAN7 . 1 using the primers MGgpdl (5'- 

GAGAAGGTGG-TTGGGTCAGTG-3' ; see SEQ ID NO: 3) and MGgpd2 
(5' -CGGGATCGTT-GGATATGTGATGTGTGGTGAAGCGG-3' ; see SEQ ID NO: 
4). Subsequently, a Bgill/Hindll I fragment from the plasmid 
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PUR5736 containing the promoter from the A. nidulans gpd 
gene and the terminator sequences from the A. nidulans trpC 
gene, was cloned into the Bami/Hindlll sites of the 
general cloning vector pBluescript" SK. This resulted in the 
plasmid PTO5737. Hereafter, pUR5737 was digested with 
Baimi, this site was filled in using the Klenow enzyme, and 
a second digestion with Ndel was performed. The plasmid 
PSCM525 (kindly provided by Prof. Dr. B. Dujon. Institut 
Pasteur, Paris and described in United State Patent no; 
5,474,896; filed Nov. 5, 1992), containing a synthetic gene 
encoding the I-Scel endonuclease, was digested with Sail. 
this site was filled in using the Klenow enzyme, and a 
second digestion with Ndel was performed. The resulting 
fragment was cloned into the plasmid pUR5737 (Wdel/filled 
in Baimi fragment as described above) which resulted in the 
I-Scel expression vector pUR5724 (see Figure 7) . 



20 



Transformation experiments 
Preparation of protoplasts: 

Conidia were obtained by growing the A. awamori strains at 
30°C on a nitrocellulose filter (Hybond-N, Amersham) placed 
on a PDA (Potato Dextrose Agar) plate for several days and 
subsequently washing the filters with physiological salt 

solution. 

25 Protoplasts of A. awamori were prepared as described by 
Punt and Van den Hondel (Methods in Enzymology lis (1993) 
447-457) . A shake flask containing 200 ml of MM medium (0.4 
ml 1 M MgSO,, 2 ml 100 x spore elements (per litre; 60 g 
EDTA.2H,0, 11 g CaCl,.2H,0, 7.5 g FeS0,.7H,0, 2.8 g MnSO,.H,0, 
30 2.7 g ZnS0,.7H,0, 0.8 g CuS0,.5H,0, 0.9 g CoCl,.6H,0, 0.5 g 

Na,MoO,.2H,0, 0.8 g H3BO3, 0.5 g KI, pH 4.0 with NaOH), 10 ml 
20% glucose, 4 ml 50 x AspA (3.5 M NaNO,, 0.35 M KCl, 0.55 M 
KH,PO„ PH 6.5 with KOH)) including 0.5% yeast extract was 
inoculated with 10* conidia/ml of A. awamori and incubated 
35 for 18 hours at SO-C in a shaker at 200 rpm. Mycelium was 
harvested through sterile Mirocloth'* and washed with ice- 
cold 0.6 M MgSO,. The mycelium was resuspended in OM mediv 
(per litre: 500 ml 2.4 M MgSO,, 480 ml Hp, 16.8 ml 0.5 M 



Lum 
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Na2HP04, 3.2 ml 0.5 M NaH2P04, pH 5.8-5.9) at 5 ml/g 
mycelium. Subsequently, 5 mg Novozym 234*^ and 6 mg BSA were 
added per g mycelium. Protoplasting was allowed to proceed 
for 1-2 hours at 30°C in a shaker at 80-100 rpm. The 
5 formation of protoplasts was checked using a light 
microscope. Protoplasts were filtered through sterile 
Miracloth*^ and the sample was divided in 3 0 ml aliquots in 
falcon tubes. STC (1.2 M sorbitol, 10 mM Tris/HCl pH 7,5, 
50 mM CaCl2.2H20) was added to bring the volume up to 50 ml 
10 and the protoplasts were harvested by centrif ugation at 

2000 rpm for 10 minutes at 4^C. The protoplasts were washed 
again in 50 ml STC and resuspended in STC at a 
concentration of approximately 10^ protoplasts/ml, 

15 PEG transformations: Five to 7.5 /zg of a single plasmid or 
two plasmids (in case the I-5ceI expression plasmid is co- 
transformed) was added to an aliquot of 100 fil (lO') 
protoplasts, mixed and incubated for 25 minutes on ice. PEG 
was added in two 200 /zl aliquots and an 850 fil aliquot, and 

2 0 the mixture was incubated at room temperature for 20 

minutes. Finally, the mixture was washed with 10 ml of STC, 
harvested by centrif ugation at 2000 rpm for 10 minutes at 
room temperature and the sample was plated on a MM plate 
for selection of transf ormants . 

25 

Construction of the A. awamori mutant pyrG strain AWCSCE: 

Transformation of the wild- type A. awamori strain was 
performed with a purified (Qiaex gel extraction kit; Qiagen 
cat. no. 20021) EcoRl fragment obtained from the plasmid 

30 pUR5712 containing the mutant pyrG gene with the I-Scel 

restriction site at the site of the deletion (see figure 1 
and 8) . Per transformation 2 x 10* protoplasts were 
transformed with 10 ^g of DNA. Since pyrG strains are 
resistant to 5-FOA (5-f luoro-orotic acid; Boeke et al. Mol . 

35 Gen. Genet. 197 (1984) 345-346), pyrG transf ormants can be 
selected directly from wild-type strains. Transf ormants 
were selected on MM plates (AspA is replaced by AspA-N; 50 
x Aspa-N = 0.35 M KCl, 0 . 55 M KH2PO4, pH 6 . 5 with KOH) 
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supplemented with 10 mM Uridine and 0.75 mg/ml of 5-FOA, 
with 10 mM proline as the N-source. The mutant phenotype of 
the transfortnants that were obtained was checked by growing 
these colonies on MM plates without uridine. Two 
5 transformants that were not able to grow without uridine 
were further analyzed by Southern blot analysis (Figure 9 
and see below) . 

DNA isolation, PCR and Southern analysis 

10 southern analysis was performed to confirm at a molecular 
level that the mould cell had been transformed and the 
desired DNA had been integrated into the genome. 
To obtain mycelium material for a genomic DNA isolation 
approximately lo" mould conidia were inoculated in 50 ml of 
15 Aspergillus minimal medium supplemented with 0.5% yeast 

extract and incubated for a period ranging from 22 hours to 
3 days at 30 oc in a shaker at 200 rpm. The mycelium was 
harvested through Miracloth« (Calbiochem) and snap frozen in 
liquid N,. Frozen samples were ground to a fine powder using 
a Mikro-Dismembrator« (ex Braun Biotech International) for l 
minute at 1750 rpm. Mould genomic DNA was isolated using 
Qiagen genomic tips (cat. no. 10223) and a protocol for 
genomic DNA purification from filamentous fungi provided by 
the supplier. The step for digestion of cell wall material 
25 was omitted. 

The PCR reactions were performed in a Perkin Elmer DNA 
Thermal Cycler 480 using approximately i ^g genomic DNA, 25 
pMol of each primer, lOnMol of each dNTP, i unit of Tag DNA 
polymerase (Gibco-BRL) and 10 ^1 of 10 x Tag DNA polymerase 

30 buffer in a total volume of lOO ;.l . The reactions were 
overlaid with mineral oil. The amplification was started 
with 5 min at 94oc, followed by 30 cycles of 1 min 94oc i 
mm 55°C and l min 72oc. After the final cycle the 
elongation step was followed by another 5 min at 72°C The 

3 5 sequence of the primers that were used are - 

MGPyrl: 5 ' -GCCAGTACACTACTTCTTCG-3 ' (see SEQ ID NO- 5) 

MGPyr2: 5 ' - AGGAGATCGCGAGAAGGTTG - 3 ' (see SEQ ID NO- 6) 



■20 
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For the Southern blot, approximately 2.5 /ig of DNA was 
digested with (a) restriction endonuclease (s) at 4 Units//xg 
for 16 hours. The following restriction endonucleases were 
used; Sau3Al , Bglll , 1-Scel and Sail. The DNA was 
separated on a 0.8% agarose TBE gel and transferred to a 
Hybond N membrane by capillary blotting (overnight) . The 
membrane was (pre- ) hybridized according to the Hybond 
protocol . 

For the Southern blot presented in Figure 9 the chromosomal 
DNA (7.5 /ig) was digested with Sau3AI. The blot was probed 
with an 18 bp end-labelled oligonucleotide representing the 

I-Scel restriction site. This oligonucleotide was end- 
labelled using T4 -polynucleotide kinase and 7-^^P-ATP. 
Hybridization was carried out for 4 hours at 42 °C. The 
filter was washed for 5 minutes with 2 x SSC at 42 °C 
followed by another 5 minutes with 2 x SSC, 0.1% SDS at 42 
°C. 

For the Southern blot presented in Figure 10 the 
chromosomal DNA was digested with Bglll or Bglll and I- 
Scel. The 2 . 4 kb BaxtiRl x HindlU pyrG fragment from pAW4 . 1 
was used as a probe. A DNA probe labelled with or^^P-dCTP was 
obtained using the RTS RadPrime DNA Labelling System from 
GibcoBRL (cat. no. 10387-017) , The electronic autoradio- 
graphs were obtained using an Instant Imager (Packard) . 

Exaz]:^>le lA. Experimental setup* 

The experimental design of the process for site-directed 
integration of multiple copies of a gene in the mould A. 
awamori using the l-Scel endonuclease is shown in figure 8. 
The system is based on three components, a fungal strain 
containing the target sequence with a I-Scel restriction 
site, a repair construct that is carrying sequences 
homologous to the target locus together with multiple 
copies of (a) gene(s) encoding (a) desired protein (s), and 
a plasmid containing an expression cassette with the gene 
encoding the I-Scel endonuclease. In order to specifically 
detect integration by homologous recombination we used the 
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endogenous pyrG gene as a selectable marker gene (Gouka et 
al. Curr. Genet. 27 (1995) 536-540). First, we constructed 
a plasmid containing a defective pyrG gene in which a 0.8 
kb region, encompassing the 3' end of the coding region and 
5 flanking terminator sequences, was replaced by the I-Scel 
restriction site (pUR5712). Using this plasmid the A. 
awamori mutant pyrG strain AWCSCE was constructed by a 
selection strategy for gene -replacement in fungi (see 
Materials and Methods) . Second, the repair construct 
10 contains a complementary defective pyrG gene, in which 0.14 
kb of the 5' end of the coding region is deleted, that has 
sequences homologous to both sides of the I-5ceI site at 
the pyrG target locus in the A. awamori mutant pyrG" . strain 
AWCSCE. This repair construct contains an unique Hindlll 
15 restriction site that can be used for inserting multiple 
copies of (a) gene(s) encoding (a) desired protein (s) . The 
complete insert of the repair construct is flanked by Notl 
sites, which makes it possible to remove the vector 
sequences and transform A. awamori with only the insert 
20 fragment. Third, the expression cassette of the I-Scel gene 
consists of the promoter from the efficiently expressed 
glyceraldehyde- 3 -phosphate dehydroge-nase encoding gene 
from Aapergillus nidulans (gpdA) (Punt et al. Gene 56 
(1987) 117-124), an artificial I-5ceI ORF (United States 
25 Patent no; 5,474,8 96; filed Nov. 5, 1992) and the 

transcription termination region of the A. nidulans trpC 
gene (Mullaney et al. Mol . Gen. Genet. 199 (1985) 37-45). 
When the expression plasmid and the repair construct are 
co-introduced into protoplasts of the strain AWCSCE, 
30 transient expression of the I-Scel gene may lead to' the 
introduction of double-strand breaks at the I-Scel site, 
thereby stimulating homologous recombination with the 
repair construct and integration of the multiple gene 
copies. Alternatively, the I-Scel endonuclease may be 
35 introduced directly into the cell in a way that is similar 
to Restriction Mediated Integration (REMI; Kuspa and 
Loomis, Proc. Natl. Acad. Sci . USA 89 (1992) 8803-8807; 
Redman and Rodriguez, Exp. Mycol. 18 (1994) 230-246; 
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Brenneman et al . Proc. Natl. Acad. Sci. USA 91 (1996) 3608- 
3612) . Because homologous recombination will restore an 
intact pyrG gene, these events can be selected directly by 
growing the transformed cells on MM plates without uridine. 

Example IB, Determination of the occurrence of a natural 
I'Scel restriction site in the genome of A. awamori . 
Before the system of this invention was set-up, we 
determined whether naturally occurring I-Scel restriction 
sites are present in the genome of A. awamori , The 1-Scel 
endonuclease has an 18 bp recognition site, which will 
statistically occur only once in 6.9 x 10*° bp (4**) or 
approximately once in every 18.500 A. awamori genomes. 
Therefore it seems unlikely that an I-Scel site will be 
present. in the genome. 

In order to determine the presence of a naturally occurring 
I-5ceI site, a Southern blot was performed with A. awamori 
genomic DNA. The genomic DNA was digested with Sau3AI, 
which does not cut within the I-Scel restriction site, to 
create smaller fragments that are more suitable for 
Southern blotting and hybridization with a labelled 
oligonucleotide. Plasmid reconstructions with the plasmids 
pUR5712 and pSCM522 (control DNA substrate containing the 
I-5ceI restriction site, supplied with the I-Scel 
endonuclease from Boehringer Mannheim, cat. no. 1497235), 
both containing a single I-Scel restriction site, 
representing a single copy site or 20 or 200 sites per 
genome were included as controls. The blot was probed with 
an 18 bp end-labelled oligonucleotide representing the I- 
Scel restriction site. For the autoradiograph see Figure 9. 
Whereas the single copy reconstructions with the control 
plasmids show a clear hybridizing fragment (lanes 3 and 6) , 
no hybridizing fragments are present in the lanes 4 and 5 
containing the chromosomal A. awamori DNA. This result 
demonstrates that the genome of A. awamori does not contain 
a natural I-Scel restriction site. Thus, it is possible to 
specifically engineer an unique I-Scel site into the genome 
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30 



at a locus of choice and subsequently introduce an unique 
double-strand break in the genomic DNA at that locus by 
expressing the I-Scel gene in the cell or introducing the 
I-Scel endonuclease itself. 

5 

Example IC. Construction of the A. awamori mutant pyrG^ 
strain AWCSCE which contains an I-5ceI restriction site at 
the locus of the mutated pyrG gene. 

The A. awamori mutant pyrQ strain AWCSCE is obtained by 
10 replacing the chromosomal wild -type pyrG gene with the 
mutant pyzG gene from pUR5712, which contains the I-Scel 
restriction site at the site of the deletion. Therefore, 
the A. awamori strain was transformed with a purified EcoRl 
fragment containing the insert from the plasmid pUR57i2 . 
Per transformation 2 x 10« protoplasts were transformed with 
10 fig of DNA. In seven transformations a total of 11 S-FOA" 
colonies were obtained. All these transf ormants were not 
able to grow on MM plates without uridine, corresponding 
with a pyrG- phenotype . Genomic DNA was isolated from these 
strains and a PCR was performed with the primers MGPyrl and 
MGPyr2. These primers anneal within the regions flanking 
the deletion and will generate a 1.13 kb fragment when the 
wild-type pyrG gene is present and a 0.35 kb fragment when 
the mutant pyrG gene is present. Two out of the 11 S-foa"^ 
colonies contained the 0.3S kb fragment specific for the 
mutant pyrG gene and were further analyzed by Southern 
blotting. The remaining 9 5-F0A« colonies are likely to be 
the result of spontaneous mutations. DNA was digested with 
Bglll and I-Scel and the blot was probed with a 2.4 kb 
Bami X Hindlll fragment from pAW4.1 containing the A. 
awamori pyrG gene {Figure 10) . In the DNA of both mutant 
strains and the wild-type strain a 

9.0 kb Bgili fragment was present. The other 2 . 7 kb Bglll 
fragment, as present in the wild-type strain (lane 3), is 
absent in the mutant strains and replaced by a 0.5 kb 
Bglll X l-5cel fragment characteristic for the mutant pyrG 
gene carrying the I-Scel restriction site at the site of 
the deletion. This result indicates that these mutants 
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originate from a replacement of the wild-type pyrG gene by 
the mutated pyrG gene from pUR5712. Because the mutant 
strain from lane 1 contains an additional unexplained 
hybridizing fragment of approximately 2.0 kb, the mutant 
strain from lane 2 was chosen for further experiments and 
designated AWCSCE. 

Exainple ID. Induction of site-directed integration at the 
pyrG locus by I-Scel expression. 

Protoplasts derived from the A. awamori mutant pyrG- strain 
AWCSCE were transformed with a repair construct in the 
presence or absence of the I-5ceI expression vector 
PUR5724. The repair constructs consisted of a mutant ' 
complementing pyrG gene (pUR5718) , and derivatives thereof 
containing a repeat of 4 copies (pUR5722) or 9 copies 
(pUR5725) of a cutinase expression cassette. Prior to 
transformation, DNA from the repair constructs was digested 
with NotI, This released the insert from the vector 
sequences, thereby creating ends that are homologous to the 
target locus. The results of the transformation experiments 
are shown in table 1, The transformation frequencies were 
calculated from parallel transformations with the positive 
control construct pAW4.2, containing the wild-type pyrG 
gene . 

Transformation of pUR5718 without the I-Scel expression 
vector pUR5724 yielded 16 recombinants corresponding to a 
gene targeting frequency of 10.6%, whereas including the 
I-Scel expression vector yielded 63 recombinants 
corresponding to a gene targeting frequency of 41.7%. These 
results indicate that gene targeting with the repair 
construct pUR5718 is stimulated approximately four-fold by 
the introduction of a double strand break at the pyrG locus 
in the strain AWCSCE. 

Transformation of pUR5722, containing the repeat of 4 
copies of the cutinase expression cassette, without the I- 
Scel expression vector pUR5724 yielded no recombinants. 
This means that the gene targeting frequency is lower than 
3.6%. In contrast, transformation of pUR5722 with the 
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l-5cel expression vector pUR5724 yielded 5 recombinants 
correspon-ding to a gene targeting frequency of 18%. These 
results indicate that gene targeting of a repair construct 
containing a tandem array. of 4 copies of the cutinase gene 
5 is about 2 to 3 fold less efficient than that of the repair 
construct pUR5718 which only contains pyrG sequences. 
Moreover, these results indicate that gene targeting with 
the repair construct pUR5722 is stimulated at least five- 
fold by the introduction of a double strand break at the 
10 pyrG locus in the strain AWCSCE. 

Transformation of pUR5725, containing the repeat of 9 
copies of the cutinase expression cassette, without the I- 
Scel expression vector pUR5724 yielded also no 
recombinants. This means that in this case the gene 
15 targeting frequency is lower than 0.6%. In contrast, 
transformation of pUR5725 with the 

l-5cel expression vector pUR5724 yielded 2 recombinants 
corresponding to a gene targeting frequency of only 0.4%. 
These results indicate that gene targeting of a repair 
20 construct containing a tandem array of 9 copies of the 

cutinase gene is about 20 to 100 fold less efficient than 
that of the repair construct pUR5718 which only contains 
pyrG sequences. 

In conclusion, these results demonstrated that gene 
25 targeting with the repair construct containing multiple 

gene copies of a heterologous gene was only possible when a 
double strand break was introduced at the pyrG locus in the 
strain AWCSCE. Gene targeting of a repair DNA becomes much 
more inefficient when an increasing number of gene copies 
3 0 are included with the repair construct. This further 
confirms the problem of site-directed integration of 
multiple gene copies as discussed in the section background 
of the invention and prior art. 

35 Example IE. Southern blot analysie of recombination events 

The wild-type pyrG phenotype of several recombinants was 
confirmed by streaking the conidia on MM plates. This was 
done for 9 and 10 recombinants obtained from 
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transformations with pUR5718 in the absence or presence of 
the I-5ceI, respectively, and all the recombinants obtained 
from transformations with pUR5722 and pUR5725. From ten of 
these transformants, conidia from individual colonies were 
streaked again on MM plates. Subsequently conidia were 
isolated and cultures were grown to obtain mycelium for 
genomic DNA isolation. DNA isolation and Southern analysis 
is described in Materials and Methods. The genomic DNA was 
digested with Bglll or Sail. The Southern blots were probed 
with either the 2.4 kb BamHI x Hindlll pyrG fragment from 
pAW4.1, the 0.46 kb AfUl x Sad fragment encompassing the 
terminator region from the endoxylanase gene (pUR5729) , the 
0.72 kb BamHI x Sail fragment encompassing the I-Scel gene 
or the pJB8 vector. The experimental setup of the Southern 
analysis is shown in Figure 11 and the autoradiographs of 
the Southern blots hybridized with the pyrG and TexJA 
probes are depicted in Figure 12A and B, respectively. 
In the Southe rn blot of wild- type A. awamori genomic DNA 
digested with Sail and probed with the 2.4 kb pyrG fragment 
a 3.3 kb and 3 . 8 kb fragment are present (Figure 12A, lane 
4) . In the mutant strain AWCSCE the 3 . 8 kb fragment is 
replaced by a 3 kb fragment (Figure 12A, lane 5) . In 
recombinants obtained with the plasmid pUR5718 restoration 
of an intact pyrG gene will lead to a replacement of the 3 
kb fragment with a 3.7 kb fragment (Figure 12A, lane 1-3). 
The small difference in size of the latter fragment 
compared to the same fragment in the wild- type strain is 
caused by a small 0.15 kb marker deletion in the 3' 
flanking sequence of the pyrG gene in the repair construct 
pUR5718. Because there is no SaJI site present within the 
cutinase expression cassette, site-directed integration of 
multiple cutinase gene copies in recombinants obtained with 
the plasmids pUR5722 (four cutinase gene copies) and 
PUR5725 (nine cutinase gene copies) is expected to lead to 
a replacement of the 3 kb fragment with a 9 . 6 kb or a 17 , l 
kb fragment, respectively. This replacement is observed for 
one pUR5725 recombinant (lane 7) and one pUR5722 
recombinant (lane 10) ; which demonstrates the successful 
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one step site-directed integration of multiple gene copies 
in A. awamori. Unexpectedly, in the other recombinants one 
or more copies of the cutinase gene have been lost during 
recombination. The remaining pURS722 recombinants contain 
5 two copies of the cutinase gene (Figure 12A, lane 9, 11 and 
12) or only one copy (Figure 12A, lane 8) . The other 
PUR5725 recombinant (Figure 12A, lane 6) contains three 
copies of the cutinase gene. 

Figure 12B depicts the Southern blot of genomic DNA 
10 digested with Bglll and probed with the Te^JA probe. This 
probe hybridizes with the endogenous exlA gene and the 
introduced multiple copies of the cutinase expression 
cassette. Thus the wild- type strain, the mutant AWCSCE 
strain and the recombinants obtained with pUR5718 contain 
only a 6 kb Bglll fragment representing the endogenous exlA 
gene (lanes 1-5) . Because the cutinase expression cassette 
contains one BgrJII site, this digestion will generate a 1.5 
kb fragment in the recombinants containing multiple copies 
of the cutinase gene. The intensity of this repeat fragment 
relative to the endogenous exlA fragment is an indication 
for the number of cutinase gene copies that are present. 
The recombinants obtained with pUR5725 (lane 6 and 7) 
contain the additional 1.5 kb fragment. The intensities of 
the bands correspond well to the presence of three (lane 6) 
or nine copies (lane 7) of the cutinase gene as was also 
determined by the total size of the fragments (Figure 12A) . 
The same is true for the recombinants obtained with 
PUR5722. Due to the orientation of the cutinase gene copies 
relative to the pyrG gene (see figure 5) the Bglll 
digestion will generate the 1.5 kb repeat fragment and a 2 
kb border fragment. The recombinant in lane 8 contains only 
the repeat fragment, indicating the presence of one copy of 
the cutinase gene. The recombinants in lanes 9, 11 and 12 
contain a repeat fragment that has the same intensity as 
the border fragment, which confirms the presence of two 
copies of the cutinase gene. The recombinant is lane 10 
contains a repeat fragment that is about three times more 
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intense than the border fragment which confirms the 
presence of four copies of the cutinase gene. 
In order to determine if other DNA, such as cosmid vector 
sequences, or the plasmid containing the I-Scel expression 
cassette had co- integrated into the genome in the 
recombinant lines, the Southern blots have also been probed 
with the 0.72 kb BairiHl x Sail fragment encompassing the I- 
Scel gene (Bglll digested DNA) or the pJB8 vector (Sail 
digested DNA) . These blots demonstrated that in none of the 
recombinants, except for the pUR5725 recombinant containing 
nine copies of the cutinase gene, other DNA had been 
integrated (results not shown) . This demonstrates that it 
is possible to construct "food-grade" strains that contain 
multiple copies of a gene without the presence of other 
foreign DNA. It should be noted that in the experiments 
described here the inserts of pUR5722 and pUR5725 had not 
been purified from the vector DNA prior to transformation, 
whereas this is possible. Moreover, it may be possible to 
omit the use of the I-5ceI expression vector by 
transforming the endonuclease directly. These modifications 
will further improve the selection of strains that do not 
contain other foreign DNA. 

This Example shows that advantages of this process for 
site-directed integration of multiple copies of a gene in a 
mould are : 

multiple gene copies can be introduced at a 
predetermined locus in the genome; 

the possibility to obtain site-directed integration 
of multiple gene copies in the genome is 
significantly improved by the introduction of a 
specific double-strand break at the chromosomal 
target in the mould cell; and 

the method results in a mould strain without residues 
of bacterial antibiotic resistance markers or other 
bacterial sequences like origins of replication, 
which the consequence that the resulting mould 
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Strains or products derived therefrom are so-called 
" food-grade " products . 
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(1) GENERAL INFORMATION: 

(i) APPLICANT (except for U.S.A.): 

(A) NAME: Unilever N.V. 

(B) STREET: Weena 4 55 
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(ii) TITLE OF INVENTION: A process for site-directed 
integration of multiple copies of a gene in a mould 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version 

#1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: (not yet known) 

(2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: restriction site for I-Scel 
endonuclease from Saccharomyces cerevisiae 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
TAGGGATAAC AGGGTAAT 

J. o 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
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. (vii) IMMEDIATE SOURCE: 

(B) CLONE: synthetic linker containing EcoRl and 
Notl restriction sites replacing an EcoRI/Hindlll 
polylinker fragment thereby destroying the Hindi II site 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AATTCATGCG GCCGCTAGCT 2 0 

(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer MGgpdl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GACAAGGTCG TTGCGTCAGT C 21 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer MGgpd2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGGGATCCTT CCATATGTGA TGTCTGCTCA AGCGG 3 5 
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(2) INFORMATION FOR SEQ ID NO : 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer MGPyrl 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GCCAGTACAC TACTTCTTCG 20 

(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 
(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer MGPyr2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGGAGATCGC GAGAAGGTTG 20 
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CLAIMS 

1. A process for site-directed integration of multiple 
copies of a gene in a mould, which comprises 

(i) providing a mould cell containing in its chromosomal 
DNA a restriction site for a rare-cutting endonu- 
clease , 

(ii) transforming such mould cell with a piece of DNA 
comprising in the 5' to 3' direction in the following 
order 

(a) a first DNA fragment homologous to part of the 
DNA upstream and in the neighbourhood of the re- 
striction site for the rare-cutting endonuclease 
present in the chromosomal DNA of the mould 

(b) multiple copies of at least one expressible gene 
comprising a structural gene encoding a desired 
protein, 

(c) a second DNA fragment homologous to part of the 
DNA downstream and in the neighbourhood of the 
restriction site for the rare-cutting endonu- 
clease present in the chromosomal DNA of the 
mould, 

while during the transformation of the mould the 
presence of the rare-cutting endonuclease is 
provided, 

(iii) selecting or screening for a mould cell in which the 
multiple gene copies of said expressible gene are inserted 
into the chromosomal DNA of the mould. 

2. A process according to claim 1, in which the rare- 
cutting endonuclease is I-Scel. 

3. A process according to claim 1, in which the restric- 
tion site for the rare-cutting endonuclease has been intro- 
duced at a desired locus. 
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4 . A process according to claim 3 , in which the desired 
locus is within a selectable marker gene or in the 
neighbourhood thereof. 

5. A process according to claim 4, in which the piece of 
DNA comprises a third DNA fragment that completes any 
disrupted or partially deleted selectable marker gene in 
the chromosomal DNA. 

e. A process according to claim 4, in which the part of 
the DNA up-stream of the restriction site for the rare- 
cutting endonuclease present in the chromosomal DNA of the 
mould, to which the first DNA fragment is homologous, is 
part of a selectable marker gene. 

7. A process according to claim 4, in which the part of 
the DNA down-stream of the restriction site for the rare- 
cutting endonuclease present in the chromosomal DNA of the 
mould, to which the second DNA fragment is homologous, is 
part of a selectable marker gene. 

8. A process according to claim 1, in which the restric- 
tion site for the rare-cutting endonuclease occurs 
naturally in the chromosal DNA of the mould. 

9. A process according to claim 1, in which two or more 
restriction sites for the rare-cutting endonuclease are 

present . 



10. A process according to claim 1, in which the expres- 
sible gene comprises (1) a promoter operable in said mould 
(2) optionally a DNA fragment encoding a secretion signal 
peptide facilitating the secretion of said desired protein 
from said mould, (3) a structural gene encoding said 
desired protein, and (4) optionally a terminator operable 
m said mould, whereby the promoter and the optional 
terminator control the expression of the structural gene 
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11. A process according to claim 1, in which during the 
transformation of the mould the rare-cutting endonuclease 
is provided by adding the endonuclease as such, and/or is 
formed in situ by co- transforming the mould with DNA 
encoding the rare-cutting endonuclease, which DNA is to be 
expressed during or after the transformation of the mould. 

12. A process according to claim 1, in which the mould 
belongs to the group of Eumycota, and preferably is 
selected from the group consisting of the fungal sub- 
divisions Ascomycotina, Basidiomycotina, Deuteromycotina, 
Mastigomycotina, and Zygomycotina. 

13. A process according to claim 12, in which the mould 
is selected from the genus Aspergillus, and preferably 
belongs to the species Aspergillus awamori, 

14. A transformed mould obtainable by a process as 
claimed in claim 1. 

15. A process for culturing a transformed mould obtained 
by a process as claimed in claim 1 or obtainable by such 
process . 

16. A process for producing and optionally secreting a 
desired protein by carrying out a process as claimed in 
claim 15 under conditions whereby the structural gene 
encoding said desired protein is expressed, and optionally 
isolating or concentrating the desired protein. 
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Figure 1. 
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Figure 4. 
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Figure 6. 
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Figure 9 
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